CDS

Accession Number TCMCG009C06823
gbkey CDS
Protein Id XP_030503611.1
Location complement(join(48279792..48279956,48280076..48280360,48280676..48280891,48282436..48282642,48283214..48283496,48283593..48283771,48283869..48284180,48285997..48286053,48286181..48286434,48286718..48286795,48288122..48288461,48288552..48288802,48289457..48289628,48289740..48289817,48289906..48289968,48290232..48290330,48290428..48290583))
Gene LOC115718925
GeneID 115718925
Organism Cannabis sativa

Protein

Length 1064aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA560384
db_source XM_030647751.1
Definition protein ALWAYS EARLY 2 isoform X5 [Cannabis sativa]

EGGNOG-MAPPER Annotation

COG_category K
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCCAACCAAGAAATCTCGGGTGAACAAGCGGTATGGAAGTGCAATTTCTTATTCTCCGGAGAAAGAGGTAGGAAATTCGGCTAAGAACAAGCAACGCGGTTCAATTTCCTTTTCTCCTGATGGCGGGAACTCCAACAAAAGCCTTCAAAGAAAGAAAAAACTATCAGATAAGTTGGGGTCTCAGTGGAGCAAGGGAGAGCTTGAGCGTTTTTATGATGCTTATCGGAAGTATGGAAAAGACTGGAGGAAGGTTGCTGCTGCAGTGCGCAACAGAACTGTCGAAATGGTTGAGGCTCTTTACAGTATGAATCGGGCATACTTGTCGCTGCCAGAAGGAACAGCTTCTGTTGTTGGCCTTATAGCAATGATGACAGATCACTATAATGTTCTGGAAGGGAGCGATAGTGACCAAGAGAGCAATGATGGTTCAGGAATTTCTCGAAAACTTCCAAACCGCAAGCGTGGCAGAGATAATTTTAACACCTCAAAGGATCTTTTCCAATCTCACTCAGTTGCTTCTACTGATGGATGCTTATCATTACTCAAAAGGAAACGCAGTGATGGTAGCCAGCCTCGTGTAGTTGGAAAGCGGACACCACGCTTTCCAGTTTCATACTCACATAAGAGAGATTATAGAGAAAATCAAATGTCACCCAATAGGAAGGGCAAAAAGGCTGAGAATGATAATGATGATGCACATGTAGCAGCATTGGCATTGACTGAGGCTTCTCAAAGAGTAGGTTCCCCTCAAGTTTCTACACCATACAAGCATATCAGCTCTTCTCCTGCTCAAAGCTGGGAAAGATTAGCACTAAGCTCACAAAGACTCCGTGATACTTCTGTGGATGACGATTGGTTTGAAGGTAGTGTTGGAAGCAGGGGAGCTGAAAATGGAGATTATGGGAAGGATACTAGCTCCTTGATGGCTATGGAAGGCGTTGGTACAGTTGAGGTTCATCAAAAGGGGAAGAAGTTTTATGGAAAAAAAGAGAAAGTTGAGGATGATGATGGTGGGGAAGCATGTAGTGGCACAGAAGAAGGGAAAAGTATCAGCGGTTTGAAGGGAAAAATTGGTGTTGAATTTTCAAATGCGAAAGTTGAGCGAGTTTCTCCGCAAAGTCAACGGAAGAAAAGCAAAAAGTTATTTTTTGGAGATGAAAGCTCTGCCATTGATGCTCTTCAAACGTTGGCTAATTTGTCGCTCATGATGCCTCCATGTACCGTGGAATCTGAATCGTCTGTCCAATTGAAGGAAGAAAGAACAACACTAGATGCAGAAGAAAAAAGCAGTGTACCTGAAGCGACAAGCCAAGTTAGAAACAAAAATAAACTCTCAGGTGCTAAACAAAAGGCACCCCCTACAATTTCCAAAAAGTCCACACTTGGAAGGGATGCAAATGGTGATATTATTAATGATTCAACAACGGGACAATTTCTTTCTGAGAACAAATTATTGAAAAGGAGGCGGAAGTCCTCGATACCAAAGATGTCAAAAGTAGAAGCTCGTTTAGATGCTATTTTAAAGGGAACTTTTAAGACCGAGGTTATTTGTGAAGAGGAGAGTAAACCAGTGATTAAAGGTAAACGGAGTAGCCAATCTTCTACTCCTTCAAAACAGTGGAAATCAGTAGGATCATCTGAAGGTTCTTTGAGTGGTGAATTTAAAAGAACTGGAAGTGGCACTGATTTAGCTGTATCTACTACTCAAGTTCCTGCTGCCAGCCAAGTTAACTTACCAACTAAGCAAAGAAGTAAGCGCAAAATGTATCTACCCCAAACATTGCCCACCAAAGACATAAAGTCTACGCAGAATATTGTTAAACGGAAGGTTAACAAAAATTTCACTTTGCCAGAGGAGAAACTTTCTTGTTTCCTATCATCAACTATGGTACGGAGATGGTGCACATTTGAGTGGTTTTATAGTGCGATAGATTACCCTTGGTTTGCCAAGAGGGAATTTGAGGAGTACTTAAACCATGTTGGATTGGGGCACATCCCAAGGTTAACTCGGGTTGAATGGGGTGTCATTAGAAGTTCCCTTGGAAAACCTCGGAGGTTTTCTGAACTCTTTCTTCGTGAAGAAAGGAAGAAACTTAAACAATATCGAGAATCTGTTAGAGAGCATTATACTGAACTCCGCAATGGAGTTAGGGAAGGTCTCCCTACAGATTTAGCGAGACCTCTTACAGTTGGACAACGGGTGATTGCTATACATCCAAAAACTAGAGAAGTTCACGATGGAAGTGTGCTTACAGTTGATCATGACAAGTGCAGGATTCAGTTTGACCATCCTGACATAGGTGTTGAATTTGTCATGGATGTTGACTGCATGCCTTCAAACCCAATGGAGAATATGCCAGAATCTCTTAGGAGACAGAACAGTACGATTGACAATTTTTCTCTTACATCTAAAGAGCCACTACCGAATGGGAATCTGAACTTTGGAGGGCCTTTGATGTTTGCTTCAAGTGGGCACATGGAGAAAGGACCTACATCTATCAATACCTTGGGAAAGCATGGAAAGGTTTCTTCAGCTTTGCATGATTTGAGGCAACGTAATACTCATCCGGGAAATGTCTTGTTCCCTGGTCCGAAGATCCCTATCAATTCCACTACCCACAGTAACGTTCCCAGTTCATTTGATAACTTTTCCATTTCTCAAGACTCAGCATCTAATCTCATTGAAATTGTTAAAGGATCGACAATAAAAGCACAATCTATGGTTGATGCTGCTATTCAGGCATTTTCATTGTGTAAGGAAGGGGAAGATGCGTATCTAAAGATTAGAGAAGCTCTAGACTCCATGGATAACAAGTTGATGACGTCTGAGTCCAGAGCATTAACAAATAAACCTCATGAGCAGGTCAATGGAACATCAGTCCATCGCTATTCGCTGATAAAATCAGAGCCTGTCATCACGGGTGATTCATCTGCTTCTAATTTGCGTACAGATTCTGACAAAAGTGAGGCAAAAGTACCTTCAGGCATCATCACTTCATGTGTTGCTAGTTTGCTCATGATACAGACATGTACAGAACGACAATATCCTCCATCTGAAGTGGCTCAAATATTAGATACAGCCGTCACAAGATTGTATCCTTTGTCTTCTCAAAATATACAAATATACAGAGAAATACAATCGTACATGGGTAGAATCAAGACTCAAATATTAGCCCTTGTACCAACTTGA
Protein:  
MAPTKKSRVNKRYGSAISYSPEKEVGNSAKNKQRGSISFSPDGGNSNKSLQRKKKLSDKLGSQWSKGELERFYDAYRKYGKDWRKVAAAVRNRTVEMVEALYSMNRAYLSLPEGTASVVGLIAMMTDHYNVLEGSDSDQESNDGSGISRKLPNRKRGRDNFNTSKDLFQSHSVASTDGCLSLLKRKRSDGSQPRVVGKRTPRFPVSYSHKRDYRENQMSPNRKGKKAENDNDDAHVAALALTEASQRVGSPQVSTPYKHISSSPAQSWERLALSSQRLRDTSVDDDWFEGSVGSRGAENGDYGKDTSSLMAMEGVGTVEVHQKGKKFYGKKEKVEDDDGGEACSGTEEGKSISGLKGKIGVEFSNAKVERVSPQSQRKKSKKLFFGDESSAIDALQTLANLSLMMPPCTVESESSVQLKEERTTLDAEEKSSVPEATSQVRNKNKLSGAKQKAPPTISKKSTLGRDANGDIINDSTTGQFLSENKLLKRRRKSSIPKMSKVEARLDAILKGTFKTEVICEEESKPVIKGKRSSQSSTPSKQWKSVGSSEGSLSGEFKRTGSGTDLAVSTTQVPAASQVNLPTKQRSKRKMYLPQTLPTKDIKSTQNIVKRKVNKNFTLPEEKLSCFLSSTMVRRWCTFEWFYSAIDYPWFAKREFEEYLNHVGLGHIPRLTRVEWGVIRSSLGKPRRFSELFLREERKKLKQYRESVREHYTELRNGVREGLPTDLARPLTVGQRVIAIHPKTREVHDGSVLTVDHDKCRIQFDHPDIGVEFVMDVDCMPSNPMENMPESLRRQNSTIDNFSLTSKEPLPNGNLNFGGPLMFASSGHMEKGPTSINTLGKHGKVSSALHDLRQRNTHPGNVLFPGPKIPINSTTHSNVPSSFDNFSISQDSASNLIEIVKGSTIKAQSMVDAAIQAFSLCKEGEDAYLKIREALDSMDNKLMTSESRALTNKPHEQVNGTSVHRYSLIKSEPVITGDSSASNLRTDSDKSEAKVPSGIITSCVASLLMIQTCTERQYPPSEVAQILDTAVTRLYPLSSQNIQIYREIQSYMGRIKTQILALVPT